Implement coalesced pooling over entire batches #368

shadeMe · 2023-01-27T15:39:08Z

Description

This PR ports a feature from curated-transformers that applies the pooling operation (one that's performed on the piece representations to get the token-level representations) on entire batches instead of individual Docs. This significantly reduces the overhead of launching the custom kernel behind the scenes, especially in high-throughput scenarios like inference.

This change improves the GPU inference performance of the German transformer model (minus the trainable lemmatizer) by 32.5% (20171.7 WPS -> 26725.9 WPS). GPU training speed also sees a modest improvement of 4.6% (3547.7 WPS -> 3713 WPS).

Types of change

Feature

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

shadeMe · 2023-01-30T18:07:59Z

Rebased on master (sorry about the force push).

danieldk · 2023-02-02T09:17:18Z

This breaks backward compat with earlier models (see CI). @adrianeboyd is that something that is acceptable? If not, the model version should probably be bumped? (Or the concat should be implemented as standalone functions that are used in trf2arrays, but that would be less elegant.)

adrianeboyd · 2023-02-02T09:35:23Z

I think you'll need a bunch of new architectures. At first glance it feels like this should be in parallel to the existing trfs2arrays instead of modifying it? And it feels like there must be some low-level tests missing, because I'm surprised that at the very least the type checking didn't break for all the Tok2VecTransformer versions.

shadeMe · 2023-02-02T11:38:44Z

On second thought, I'm kinda leaning towards using standalone functions for the concatenation as that'll let existing models/configs benefit from the change and will touch less code. Apart from the implementation being more unwieldy, are there any downsides to it?

At first glance it feels like this should be in parallel to the existing trfs2arrays instead of modifying it?

Hmm, what would be the benefit of having a separate version of trf2arrays just for the pooling behaviour? As I see it, it's more an implementation detail than a parameter that we'd want users to care about. One alternative would be to refactor trf2arrays and let it be chained with the pooler, but that has the same issue of breaking compatibility with existing models.

adrianeboyd · 2023-02-02T11:52:34Z

You have to keep the existing trfs2arrays for the legacy versions of a bunch of architectures.

shadeMe · 2023-02-03T14:21:05Z

Reimplemented with standalone functions to preserve backward compatibility (tested with models from spaCy v3.4.0 and v3.5.0).

shadeMe added enhancement New feature or request perf / speed Performance: speed feat / pipeline Feature: Pipeline components labels Jan 27, 2023

Concatenate and perform pooling once per batch

11a67bc

shadeMe force-pushed the feature/coalesce-pooling branch from 9ceb350 to 11a67bc Compare January 30, 2023 18:07

shadeMe marked this pull request as ready for review January 30, 2023 18:07

Use standalone functions for concatenation to preserve the model graph

ee55424

danieldk approved these changes Mar 13, 2023

View reviewed changes

danieldk merged commit 3965a78 into explosion:master Mar 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement coalesced pooling over entire batches #368

Implement coalesced pooling over entire batches #368

shadeMe commented Jan 27, 2023 •

edited

Loading

shadeMe commented Jan 30, 2023

danieldk commented Feb 2, 2023 •

edited

Loading

adrianeboyd commented Feb 2, 2023

shadeMe commented Feb 2, 2023

adrianeboyd commented Feb 2, 2023

shadeMe commented Feb 3, 2023

Implement coalesced pooling over entire batches #368

Implement coalesced pooling over entire batches #368

Conversation

shadeMe commented Jan 27, 2023 • edited Loading

Description

Types of change

Checklist

shadeMe commented Jan 30, 2023

danieldk commented Feb 2, 2023 • edited Loading

adrianeboyd commented Feb 2, 2023

shadeMe commented Feb 2, 2023

adrianeboyd commented Feb 2, 2023

shadeMe commented Feb 3, 2023

shadeMe commented Jan 27, 2023 •

edited

Loading

danieldk commented Feb 2, 2023 •

edited

Loading